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Abstract 

In this paper, a new state-of-the-art multi-cell MMSE scheme is proposed for massive MIMO 
networks, which includes an uplink MMSE detector and a downlink MMSE precoder. The main novelty 
is that it exploits all available pilots for interference suppression. Specifically, let K and B denote the 
number of users per cell and the number of orthogonal pilot sequences in the network, respectively, 
where /3 = B/K is the pilot reuse factor. Then our multi-cell MMSE scheme utilizes all B channel 
directions, that can be estimated locally at each base station, to actively suppress both intra-cell and 
inter-cell interference. The proposed scheme is particularly practical and general, since power control for 
the pilot and payload, imperfect channel estimation and arbitrary pilot allocation are all accounted for. 
Simulations show that significant spectral efficiency (SE) gains are obtained over the single-cell MMSE 
scheme and the multi-cell ZF, particularly for large /3 and/or I\. Furthermore, large-scale approximations 
of the uplink and downlink SINRs are derived, which are asymptotically tight in the large-system limit. 
The approximations are easy to compute and very accurate even for small system dimensions. Using 
these SINR approximations, a low-complexity power control algorithm is also proposed to maximize 
the sum SE. 
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I. Introduction 

Multi-user multiple-input-multiple-output (MU-MIMO) communication has drawn consider¬ 
able interest in recent years. By scheduling multiple users to share the spatial channel simulta¬ 
neously, the spatial degrees of freedom offered by multiple antennas can be exploited to focus 
signals on intended receivers, reduce interference, and thereby increase the system data rate [fTll— 
|f6l . These features make MU-MIMO incorporated into recent and evolving wireless standards 
like 4G long-term evolution (LTE) and LTE-Advanced [0. 

Massive MU-MIMO, or very large MU-MIMO, is an emerging technology that scales up 
MU-MIMO by orders of magnitude I®, 10. The idea is to employ an array comprising say 
a hundred, or more, antennas at the base station (BS) and serve tens of users simultaneously 
per cell. Compared to the contemporary cellular systems, the system SE can be drastically 
increased without consuming extra bandwidth 0-0. Uplink and downlink transmit power 
can also be reduced by an order of magnitude since the phase-coherent processing provides a 
comparable array gain lITOl . In the limit of an infinite number of antennas, intra-cell interference 
and uncorrelated noise can be averaged out by using simple coherent precoders and detectors, 
and the only performance limitation is pilot contamination and the distortion noise from hardware 
impairments m, ED- Furthermore, in time division duplex (TDD) mode, the channel training 
overhead scales linearly with the number of users, instead of the number of BS antennas, which 
allows for adding antennas elements without affecting the training overhead |fl2l . These features 
make massive MIMO one of the key technologies for the next generation wireless communication 
systems. 

In the uplink reception and downlink transmission, the most common linear processing schemes 
are matched filtering (MF), zero forcing (ZF) and minimum mean square error (MMSE)[] Let 
B denote the number of orthogonal pilot sequences that are available in the network, and K 
denote the number of users in each cell. We can then define (3 = B/K > 1 as the pilot reuse 
factor, since only 1//3 of the cells use the same set of pilots. In conventional massive MIMO 
systems, the BS first listens to the uplink pilot signalling from its own cell, estimates the K 
intra-cell channels and then constructs its transceiver processing based on the channel estimates 
to mitigate the intra-cell interference [fl3ll — llT6l . However, parts of the inter-cell interference can 
also be suppressed when f3 > 1. If the BS is aware of all pilot sequences, then it can locally 
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estimate B channel directions by listening to the pilot signalling from all cells instead of only 
from its own cell. Since its K users only occupy K out of the B channel directions, the BS is 
able to select its user-specific detectors in the uplink to suppress interference from other cells, and 
design precoders in the downlink to mitigate interference leakage to other cells. Based on similar 
observations, some multi-cell detection and precoding schemes have been proposed in [fl6l - [|T9ll . 
In [22, a multi-cell ZF detector (referred to as full-pilot ZF detector in ifTTlO is proposed, which 
exploits and orthogonalizes all available directions to mitigate parts of the inter-cell interference. 
It achieves a higher SE than the conventional ZF when the interfering users are near to the 
edges of the surrounding cells. In general cellular networks, however, the gain is less obvious, 
partly due to the loss in array gain of B in multi-cell ZF, instead of K as with conventional 
ZF. Uplink multi-cell MMSE detectors are proposed in lfl6ll and llT9fl . but the former is limited 
to — 1 and equal power allocation, and the latter is based on the unrealistic assumption that 
perfect channel state information (CSI) is known at BS. The multi-cell MMSE precoder proposed 
in Ifl8l brings a notable gain over single-cell processings. However, like lfT6l . this scheme does 
not account for arbitrary pilot allocation which, as shown in IfTTl . is an important way to suppress 
pilot contamination and achieve high system SE in massive MIMO deployments. Moreover, no 
closed-form performance expressions are provided in [|T8l . 

In this paper, a new state-of-the-art multi-cell MMSE transceiver scheme is proposed, which 
includes an uplink MMSE detector and a downlink transmit MMSE precoder. The novelty of 
the multi-cell MMSE scheme is that all B pilots are exploited at each BS to actively suppress 
both intra-cell and inter-cell interference. Power control for the pilot and payload, imperfect 
channel estimation and arbitrary pilot allocation are all accounted for in our scheme. Numerical 
results show that significant SE gains can be obtained by the proposed scheme over conventional 
single-cell schemes and the multi-cell ZF from ItlTl . and the gains become more significant as (5 
and/or K increase. Furthermore, large-scale approximations of the uplink and downlink SINRs 
are derived for the proposed multi-cell MMSE scheme, which are asymptotically tight in the 
large-system limit. The approximations are easy to compute since they only depend on large- 

1 A special case of the downlink MMSE precoder is the regularized ZF (RZF) precoder, which is obtained when all the users 
in a cell have equal pathlosses it201 . Since this is generally not the case in cellular networks, RZF provides lower performance 
than the MMSE precoder and is not considered in this paper. 
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scale fading, power control and pilot allocation, and shown to be very accurate even for small 
system dimensions. Based on the SINR approximations, a low complexity iterative power control 
algorithm for sum SE maximization is proposed for the multi-cell MMSE scheme. Compared 
to the equal power allocation policy, our proposed algorithm significantly improves the system 
sum SE and also provides good user fairness. 

The paper is organized as follows: In Section UH we describe the system model and the 
construction of the multi-cell MMSE transceiver. Large-scale approximations of the uplink and 
downlink SINRs are derived in Section ITTlJ Based on the SINR approximations, a low complexity 
iterative power control algorithm is proposed in Section [TV) Simulation results are provided in 
Section [V] before we conclude the paper in Section EO All proofs are deferred to the appendix. 

Notations : Boldface lower and upper case symbols represent vectors and matrices, respectively. 
The trace, transpose, conjugate, Hermitian transpose and matrix inverse operators are denoted 
by tr(-), (-) T , (•)*, (•) H and (-) -1 , respectively. 

II. System Model and Transceiver Design 

We consider a synchronous massive MIMO cellular network with multiple cells. Each cell is 
assigned with an index in the cell set £, and the cardinality |£| is the number of cells. The BS 
in each cell is equipped with an antenna array of M antennas and serves K single-antenna users 
within each coherence block. Assume that this time-frequency block consists of T c seconds and 
W c Hz, such that T c is smaller than the coherence time of all users and W c is smaller than 
the coherence bandwidth of all users. This leaves room for S = T c x W c transmission symbols 
per block, and the channels of all users remain constant within each block. Let hj ik denote the 
channel response from user k in cell l to BS j within a block, and assume that it is a realization 
from a zero-mean circularly symmetric complex Gaussian distribution: 

h jik ~ CM (0, dj (z i k ) l M ). (1) 

The vector z //c e R 2 is the geographical position of user k in cell l and dj(z) is an arbitrary 
function that accounts for the channel attenuation (e.g., path loss and shadowing) between BS 
j and any user position z. Since the user position changes relatively slowly, dj(zi k ) is assumed 
to be known at BS j for all l and all k. 

We consider a TDD protocol in this paper, where the downlink channels are estimated by 
uplink pilot signaling by exploiting channel reciprocity. In TDD mode, each transmission block 
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is divided into two phases: 1) uplink channel estimation phase, where each BS estimates the CSI 
from uplink pilot signalling which occupies B out of S symbols in each block; 2) uplink and 
downlink payload data transmission phase, where each BS processes the received uplink signal 


and the to-be-transmitted downlink signals using the estimated CSI. Let and (' ,n denote the 


fixed fractions allocated for uplink and downlink payload data transmission, respectively. These 


fractions can be selected arbitrarily under the conditions that c ul + C dl = 1 and that ( n \S - B) 


and C il (S — B ) are positive integers. In what follows, the uplink channel estimation is first 
discussed to lay a foundation for the transceiver design. 

A. Uplink Channel Estimation 

In the uplink channel estimation phase, the collective received signal at BS j is denoted as 


Y j £ C MxB where B is the length of the pilot sequences (it also equals to the number of 


orthogonal pilot sequences available in the network). Then Yj can be expressed as 

K 

y , = ££ VPlk h jlk^I lk + ( 2 ) 

lec k =i 

where h j tk is the channel response defined in CD), Pik > 0 is the power control coefficient for the 


pilot of user k in cell l, and Nj £ C MxB contains independent and identically distributed (i.i.d.) 


elements that follow CM (0, a 2 ). We assume that all pilot sequences originate from a predefined 
orthogonal pilot book, defined as V = {vi,..., v B }, where 



b\ = b 2 , 
b\ ^ b 2 , 


(3) 


and let iik £ {1,...,£>} denote the index of the pilot sequence used by user k in cell l. 
Arbitrary pilot reuse is supported in our work by denoting the relation between B and K by 
B = / 3K , where (3 > 1 is called the pilot reuse factor. If the pilots are allocated wisely in the 
network, a larger j3 brings a lower level of interference during pilot transmission, known as pilot 
contamination. 

Based on the received signal in ©, the MMSE estimate of the uplink channel is lUTTl 


h iik = VMkdj (z i k ) Yj (^*) 1 v* k , 


(4) 


where v h 7 is the covariance matrix of the vectorized received signal vec(Yj) and is given by 

K 

*3 = Ejvec (Yj) vec (Yj) "} = ££ Pimdj {z £m ) T 0"“I B- 

t€C m =1 


( 5 ) 
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According to the orthogonality principle of MMSE estimation, the covariance matrix of the 
estimation error h,-^. = h j lk — h j ik is given by 


C jlk = E |hj 7 fch" fc | = dj (z /fc ) (l - pikdj (z /fc ) 1 v ilfc ) I M - 

By utilizing that 


V^T 1 = 

y Hk3 v- Wv 


EteC Em=l (z<m)v^Vi, m + a 


V *ifc a jilk V ii k i 


x lHk 


where ctj ilk is a scalar, the estimation error covariance matrix in © can be expressed as 


( 6 ) 

(7) 


Cjik dj ( Zik ) (1 Pikdj (zik) a.ji lk B) I m- 


( 8 ) 


As pointed out in [fTTl . the part Y ; ('!'* j“ 1 v* of the MMSE channel estimate in © depends 
only on which pilot sequence that user k in cell l uses. Consequently, users who use the same pilot 
sequence have parallel estimated channels at each BS, while only the amplitudes are different 
in the estimates. To show this explicitly, define the M x B matrix 


H 


v,j 


h VJ i,...,h V js =Yj (^*) 


which allows the channel estimate in © to be reformulated as 


(9) 


hj/fc \JPikdj Hy_jO 


Hk > 


( 10 ) 


where e, denotes the Ah column of the identity matrix I b- The property that users with the same 
pilot have parallel estimated channels is utilized to derive new SE expressions in the sequel. 

Notice that the estimated channel is also a zero-mean complex Gaussian vector, with its 
covariance matrix *&ji k e C MxM being 

®jik — dj (z ik) I m ~ Cjik = Pikd] (z ik) oij ilk Bl M . (11) 

Define the covariance matrix of hy Jfl ; as Then according to (flOl) and (fill) . &v.ji = oijiBl M . 


B. Uplink Multi-cell MMSE detector 

After the uplink channel estimation, during the uplink payload data transmission phase, the 
received signal y j e C Mxl at BS j is 

K 

y j = \ZUkh-jikXik + cij, 

lec k =l 


( 12 ) 
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where is the transmit power of the payload data from user k in cell l, Xu- ~ CJ\f( 0,1) 
is the transmitted signal from a Gaussian codebook, and n, ~ CJV(0, a' 2 I M ) is additive white 
Gaussian noise (AWGN). Different symbols are used for pilot power and payload power to allow 
for different power control policies for them. Denote the linear detector used by BS j for an 
arbitrary user k in its cell as g j k , then the detected signal Xjk is 


(13) 


Xjk = gfkYj = V^sfk^jkXjk + g fk + gf k nj. 

By using the following achievable ergodic SE can be achieved for this user lU3l 

*5 = C" 1 (l - f) E{h 0)} {log 2 (1 + 4 ) } , (14) 

where ^ denotes the expectation with respect to all the channel estimates obtained at BS 
j, and the SINR rf- ] k is given by 

„ 2 

g fk h jjk 


Vfk 


Tjk 



\ 


2 2 

E < 

r jk 

g fkhjjk 

+ E T lm |g^h itm | + cr 2 


l 

(l,m)4(j,k) 


l (i) 


T jkgfk^jjk^fj k gjk 


(15) 


g fk I T jkCjjk + Xi r lm + Cjlm) + 


g jk 


where E{-|h-(j)} denotes the conditional expectation given all the estimated channels at BS j. Due 
to that the imperfectly estimated channels are available, the SE in (fl4l) is achieved by treating 
gfJ^jjk as the true channel, and treating uncorrelated interference and channel uncertainty as 
worst-case Gaussian noise [|T3ll . Thus, is a lower bound on the uplink ergodic capacity. 

The second line of Eqn. (IT5l) shows that the uplink SINR takes the form of a generalized 
Rayleigh quotient. Therefore, a new multi-cell MMSE (M-MMSE) detector can be derived to 
maximize this SINR for given channel estimates: 

K K \ -1 

EE Ti k hjikhf lk + EE TikCjik + o~ 2 l M ) h jj k . (16) 


M-MMSE 

g jk 


JGC k =1 l&C k =1 

As the name suggests, this detector (with an appropriate scaling) also minimizes the mean square 
error (MSE) in estimating Xjk lf2Tl : 


E 


x jk\ |h(j) 


(17) 









By plugging ([8]) and (fTOl) into (fT6l) . the M-MMSE detector can also be expressed as 


g 


M-MMSE 

•jk 


= (H v , i A j H» j +{a 2 + Vj )I M y 1 h 


1 jjki 


( 18 ) 


K 


where A j = YhYh r ikPikdj(^ik)^ii k % k is a diagonal matrix, and its ?'th diagonal element \ 3i 

leL k=l 

depends on the large scale fading, the pilot and payload power of the users that use the ith pilot 

K 

sequence in V. The scalar ipj is defined as ipj = S T ikdj{zi k )( 1 — pi k dj(zi k )aj ilk B), where 

Zg£ k =1 

aji lk is defined in Eqn. CO). 

To elaborate the advantages of our M-MMSE scheme, we compare it with some related work. 
First, the conventional single-cell MMSE (S-MMSE) detector from [|T3ll - [fl5l is 

K 


-l 


S—MMSE 

s .jk 


^ 1 T jrrS l jjrrS l jjm T" T & ^-M 


1 jjki 


(19) 


\m=l 


where inter-cell interference is either ignored by setting Z j = 0 or only considered statistically 
as with 

f K 

:h , x • x ■ - U UM k ( 20) 


Z j — ® 


E t h 


T jrnhjlrnhj lm 

K m= 1 l^j m— 1 

Notice that the S-MMSE detector in (fl9l) is not a pure single-cell detector if Z y in (l20l) is used, 
since statistical information about the multi-cell interfering channels is utilized in Zj. We refer 
to it as “single-cell” detector because it only utili z es the K estimated channel directions from 
within the serving cell, and treats directions from other cells as uncorrelated noise. In comparison, 
all the B available estimated directions in Hyy are utilized in our M-MMSE detector so that 
BS j can actively suppress also parts of inter-cell interference when B > K. Therefore, our 
detector can actually maximize the SINR in while the S-MMSE can only do this in single¬ 
cell scenarios. The M-MMSE scheme can be seen as a coordinated beamforming scheme, but 
since there is no signalling between the BSs (BS j estimates Hyj from the uplink pilots), the 
M-MMSE scheme is fully scalable. 

Compared with the multi-cell MMSE scheme proposed in lfl6l and lfT9ll . our detector is more 
practical and general. To begin with, power control and any fractional pilot reuse policy are 
supported in our scheme, which allows for an analysis based on a more flexible and practical 
network deployment. It is shown in lUTj that in massive MIMO systems, fractional pilot reuse is 
an important way to suppress pilot contamination and achieve high system SE. Furthermore, the 
uplink detector in lfT9l is based on the unrealistic assumption that perfect CSI is known at each 


BS, while imperfect channel estimation is accounted for in our detector. Thus the performance 
gains provided by our detector are actually achievable in practical systems. This makes our new 
M-MMSE detector the state-of-the-art method for massive MIMO detection. In Section ITTlJ an 
explicit large-scale approximation expression of the SINR in (fl5l) is provided, which allows 
for simple performance analysis and the design of resource allocation schemes without time- 
consuming Monte Carlo simulation. 


C. Downlink Multi-cell MMSE Precoder 


During the downlink payload data transmission, the received signal at user k in cell j is 

K 

Djk = 'y ^ h Ijk y ' \/~Qi rri w lrri, s lm. + Kjk, (21) 

IdC m= 1 

where w i m G C Mxl is the precoder used by BS l for user m in its cell, si m ~ CJ\f(0, 1) is the 
payload data symbol for user m in cell l, 0 [ rn is the corresponding downlink transmit power 
coefficient, and rij k ~ CA/"(0,1) is AWGN. 

Recently, an uplink-downlink duality for massive MIMO systems was established in ifTTl 
which proves that for a proper downlink power control, the uplink SEs can be achieved also 
in the downlink if each downlink precoder is a scaled version of the corresponding uplink 
detector. Since the M-MMSE detector proposed in the Subsection III-BI is the state-of-the-art 
uplink method, we apply the same methodology for downlink precoding. The downlink M- 
MMSE precoder is constructed as 


„ M-MMSE 
W jk 


y M—MMSE 

3 jk _ 

\/ 7 jk 


( 22 ) 


where 7 = E{||g^T MMSE || 2 } normali z es the average transmit power for the user k in cell j 
to E{|| x / ftm w“r MMSE s, m || 2 } = Qi m . Since there are no downlink pilots in the TDD protocol, 
the users do not know the current channel but can leam their statistical equivalent channels, 
.yoJ/TE{hj {Ii -j k w :/ /,.}, and the total interference variance. Consequently, a downlink SE 

B' 


R% = r (1 - 


5 


log 2 (1 + vfk) 


(23) 


can be achieved for user k in cell l 0, uni, where rjf k is 

Qjk | E (h} {hf jk w jk }[ 


7 fk 


E E Qim^ih} \\h^ jk w lm \ 2 \ - g jk |E {h} {hf jk w jk }\ 2 + a 2 

lCCm=l ^ J 


(24) 
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This downlink SINR holds for any linear precoding scheme, and we omit the superscript “M- 
MMSE” of w j k for brevity. The SE in ([23b is achieved by treating E ih i as the true 

channel, and treating interference and channel variations as worst-case uncorrelated Gaussian 
noise. Thus, Rf k is a lower bound on the downlink ergodic capacity. 

By utilizing all the available estimated directions, the M-MMSE precoder can suppress intra¬ 
cell interference and also reduce the interference caused to other cells, and thus a higher SINR can 
be expected by our precoder than conventional single-cell precoders, at least for an appropriate 
power control [fTTIl . In the next section, a large-scale approxmiation of the downlink SINR 
in m is derived. In lfl8l . the authors also proposed a multi-cell MMSE precoder which brings 
a notable gain over single-cell processing, but it does not accounted for arbitrary or optimized 
pilot allocation. Moreover, no closed-form performance expression is provided in lfl8ll . 

Looking jointly at the uplink and downlink, the ergodic achievable SE for user k in cell j is 

Rjk = ( x f) ( cu1e K)} { log2 + r/ S)} + ^ dllog 2 i 1 + ’nfk)) • (25) 

III. Asymptotic Analysis 

In this section, performance analysis is conducted for the proposed multi-cell MMSE scheme. 
Since the uplink SINR in dT3b depends on the stochastic channel estimates in each block, the 
uplink SE in (fl4l) cannot be computed in closed form. Therefore, a deterministic equivalent 
expression for the SINR is computed instead which is tight in the large-system limit. A large- 
scale approximation of the downlink SINR is also provided. The large-system limit is considered, 
where M and K go to infinity while keeping K/M finite. In what follows, the notation M —* oc 
refers to K, M -A oc such that limsup m K/M < oo and liminf m K/M > ()[] Since B scales 
with K for a fixed (3, limsup m B/M < oo and liminf m B/M > 0 also hold for B. The 
results should be understood in the way that, for each set of system dimension parameters M, 
K and B, we provide large-scale approximative expressions for the uplink SINR and downlink 
SINR, and the expressions are tight as M, K and B grow large. The main feature is that they 
only depend on the large-scale fading, power control and pilot allocation, and can be computed 
efficiently without the need for time-consuming Monte Carlo simulations. In what follows, the 
notation - a ' s —> denotes almost sure convergence of a stochastic sequence, and -» denotes 

M—> oo M—> oo 
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convergence of a deterministic sequence. 

Before we continue with our performance analysis, two useful results from large random 
matrix theory are first recalled in the following subsection. All vectors and matrices should be 
understood as sequences of vectors and matrices of growing dimensions. 


A. Useful theorems 

Theorem 1 (Theorem 1 in ll22lD : Let D G C MxM be deterministic and H G C MxB be random 
with independent column vectors h fe ~ CN(f). ; jjRb)- Assume that D and the matrices R/,(7i = 
1,..., B ), have uniformly bounded spectral norms (with respect to M ). Then, for any p > 0, 


M 


where T(p) G C AIxM is defined as 


tr (D (HH» +pl„) _1 ) - — tr(DT(p)) 


M—>oo 


GO, 


(26) 


R t 


T(p) + 4W 


+ P i 


M 


(27) 


and the elements of S(p) = [5i(p),..., 5 B (p)] T are defined as 5 b (p) = lim^oo <5^(p), b = 1,..., B, 
where 




R,, 


M 


M 1 + Sf ^ (p) 


+ Pl 


N 


(28) 


for t = 1, 2,..., with initial values = 1/p for all b. 


Theorem 2 (From Il22l0 Let © G C MxM be Hermitian nonnegative definite with uniformly 
bounded spectral norm (with respect to M). Under the same conditions on D and H as in 
Theorem HI 

A tr (d (HH“ + pi,,)- 1 0 (HH« + pI M )f - A tr (DT (p)) 0 (29) 

where T'(p) G C MxM is defined as 

T' (p) = T (p) ©T (p) + T (p) i- V 2 T (p). (30) 

K J K J y > J M ( i + 5 b ( p )) 2 


2 The limit superior of a sequence x n is defined by limsup^a;,, = lim (sup {xm : m > n}); the limit inferior is defined as 

n — y oo 

liminf n a; n — lim (inf {xm ■ m ^ n}). 

n—>oo 
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T(p) and 8(p) are defined in Theorem [0 and S'(p) = S[(p),S' B t(p)] T is calculated as 

d' (p) = (Ib — J (p)) -1 v (p) (31) 

where J(p) and v(p) are defined as 

[J (P)]« = 


iftr (RfeT (p) R^T (p)) 


M (1 + 8 i ( p)Y 


, 1 <b,l < B, 


[v (p)] b = jjU- (RtT (p) ©T (p)), 1 <b<B. 


(32) 

(33) 


5. Large-scale Approximations of the SINRs with the M-MMSE scheme 

In what follows, we derive the deterministic equivalent of with the M-MMSE detector, 
and the large-scale approximation f}f k of pf k with the M-MMSE precoder, such that 

^- 4 ^ 0 - ( 34 ) 


Theorem 3 For the uplink M-MMSE detector in (fl8l) . we have //"l — //"l — a ' s —> 0, where 

J J M->oo J 

is given by 


Vfk 


TjkPjktf (z jk ) 6 2 k 


E TimPimd,) (Z i m ) + E T lmdj (z lm ) 


P 

u jk ^ ■unt-un—j 

(l,m)A(jA),i lrn =ijk 


with 


Sjk = — tr f 


M 


■ v,ji jk J-j 


Pjimk = Yytr (T' fc ) - pi m dj (z iro ) A 2 + XjHm ^ 3lm 


M 


(1 + A jilt'd jlm) 


(35) 


djlm 


djlmk * 1 


= ^ tr 


where 
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1) T j = Tj(a) and (5(a) = [5 1 ,...,S B } T are given by Theorem Q] for a = a ~^ Pj and R-b = 

^jb&V.jb- 

2) T' jk = T ' jk (a) and S' (a) = [<5j, S' B ] T are given by Theorem|2]for a = © = ®v,ji jk , 

and R b = \ jb $ V:jb . 

3) T" jk = T j k (cn) and S'(a) = [<5(, ■ ■■,d' B ] T are given by Theorem |2] for a = - jp- , © = 1 M , 
and R fe = \ jb <t v ,jb- 

Proof: See Appendix |Bj ■ 


Theorem 4 For the downlink M-MMSE precoder in (l22l) . we have rfjl — r/) 1 ), 
^ is given by 


-» 0, where 

M—>oo 


vfk 


5 2 

QjkPjkdj (z jk) fjn- 

jk 


Pjk Qlmd t (Zjfc) 


lm 


di (z jk/ 

m^ l 3 k 


l-t'ljkn 

Mti” 


(36) 


+ 


M 


where 3 /m , pi-jkm and t)" m are given in Theorem [3} 


Proof: See Appendix O ■ 

By utilizing Theorem [3] and 0] the ergodic SEs /?)'). in (fl4l) and /?(’). in (1231) . after dropping 
the prelog factor (1 — j), converge to Rf k = log 2 (l + fjf k ) and Rf k = log 2 (l + rjf k ) in the 
large-system limit, respectively. Therefore, a large-scale approximation of the joint ergodic SE 
in (1251) is provided by (1 — f )(C ul -R)i + C dl RyD- This approximation is easy to compute and only 
depends on the large-scale fading, power control and pilot allocation. As shown in Section 0 
this large-scale approximation is very accurate also at small system dimensions. 


C. The Uplink and Downlink Duality for the M-MMSE scheme 

It is pointed out in iFTTlI that when the precoder is a scaled version of the detector, like (l22l) 
in our case, the same per user SEs as in the uplink can be achieved in the downlink by properly 
selecting the downlink payload power. We establish this uplink-downlink duality for our M- 
MMSE scheme, using the large-scale SINR approximations given by Theorem [3] and Theorem [4] 


Theorem 5 For the proposed M-MMSE scheme, if rf-), in (1351) is achievable in the uplink for 
user k in cell j, then a downlink power control policy { o ]k ) can be obtained by transforming 
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K K 

the corresponding uplink power {Tj k } according to Eqn. (78), such that Y T jk = Y Y Qjk 

j£Ck= 1 jGCk =1 

and that the same SE is achieved in the downlink, i.e., r/jl = rf-\. 

Proof: See Appendix [01 ■ 

Note that Theorem [5] establishes the duality for the large-scale SINR approximations, instead 
of the real SINRs. However, since the approximations are very accurate even for small system 
dimensions, Theorem [5] provides a powerful tool to obtain a judicious downlink power allocation 
whenever the same SEs are desired in both the uplink and downlink. 

IV. Iterative Power Control 

The large-scale approximations of the uplink and downlink SINRs given in Theorem [3] and 
Theorem |4] not only enable us to evaluate the system performance without time-consuming Monte 
Carlo simulation, but they also enable us to improve the system performance by optimizing key 
system parameters based on only large-scale fading. In this section, we consider optimizing 
the uplink payload transmit power jointly for the multi-cell network to maximize the weighted 
uplink sum SE. Since the downlink payload power can be obtained according to Theorem [5] the 
optimized uplink SEs can be achieved also in the downlink using the same total transmit power. 
The effectiveness of our proposed power control algorithm is testified in Section [V] 


A. Joint Uplink Power Control for Weighted Uplink Sum SE Maximization 


The power control for sum SE maximization has been widely studied in cellular networks lf23l - 
m, an d here we consider this sum SE metric for the proposed M-MMSE detector. Using the 
same notations of D, F and r defined in Appendix iDl and define the vector r = [//'(j,..., fj^ K ] T € 
R L Kxl , then the uplink SINR approximation in (1331) can be expressed as 


A = Vfk 


riD ltl 


W + s 


where (-)j denotes the /th element of the corresponding vector and l 


(37) 

k + (j — 1 )K. Using the 


notation in (1371) . we want to find the power control that maximizes the weighted SE as 

LK 

V : maximize E £i !og 2 (i + n) 

i=i 


S.t. 0 < Ti < Pmax') 
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where P ma x is the maximum radiated transmit power of each user and & > 0 is the weight for 
the corresponding user. All ^ = 1 corresponds to conventional sum SE maximization, while 
other values can be used to enforce some fairness. However, as proved in [[3Tj| , power control 
problems for sum SE maximization are strongly NP-hard. Thus lower bounding of log 2 (l + ft) 
by log 2 (r/) is often used to approximate V as V\ lf32l . If33l : 



LK 

V 2 : 

maximize 

T,q 

LK 

n« 

V 1 : 

maximize ^ £/ log 2 (r ; ) 


1 ( K 

1=1 

2 


T 1=1 

s.t. 

it 

a z 

Tj - 

3 M 

s.t. 

0 <Ti< P max VI. 


\i=i 




0 <r t < P max , VI. 


For fixed F and D, by introducing the auxiliary vector q with its Zth element qi < rf, 
problem V\ can be turned into the geometric programming (GP) problem Vo above. The optimal 
solution of Vo can be obtained numerically, for example, using the convex optimization toolbox 
in MATLAB. A low-complexity fixed point iteration method is also proposed in [f33ll to solve 
problems of the same type as Vo. With our notation, the power coefficient 77 is updated as 


' LK 


T i (t + 1 ) = min Ui/ — 


Cj F j,i (t) rj U ) 


P 

1 1 17 


(38) 


Dj(t)Tj(t) 

where t is the iteration index in the fixed point algorithm, for t — 0,1,.... It is proved in 
that starting from the initial point 77 ( 0 ) = P ma x for all l, the above algorithm converges at a 
geometric rate to the optimal solution of V\ (for fixed F and D). 


In our case, however, F and D are not fixed since 5j k , Hji m k and d" ]k will change as 77 changes. 
Hence, V 2 in our work is not a pure GP. Therefore, Algorithm |T] is proposed to iterate between 
solving Vo for fixed F and D, and updating F and D using the current r. 

In step 3, the matrices F, D, the current power Tj and the SINR r 3 of all users in the 
network are needed at each BS. Thus Algorithm [Tj involves some information exchange among 
the BSs. However, since the asymptotic approximation only depends on long-term parameters, 
the information exchange overhead is much smaller than if the sum SE would be maximized in 
every coherence block based on the current small-scale fading. Moreover, the proposed algorithm 
only involves simple calculations and converges quickly, thus it is of low complexity. Since the 
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Algorithm 1 : Approximated Sum SE Maximization Power Control Algorithm 

l: Initialize: 77 (0) = P max for all l, t = 0 and select e > 0. 

LK 

2 : Calculate F(£), D(t) and R(t) = ^&log 2 (u) using r(t). 

i=i 

3: Update r(t + 1) by (1381) . and calculate R{t + 1) based on the newly updated r(t + 1) and 
the F(t) and D(£) in step 2, until \R(t + 1) — R(t)\ < e. 

4: Update the time slot index t with t + 1. 

5: Repeat step 2-4 until R(t) converges. 


convergence has been proved in If33ll for fixed F and D, and we improve them in each iteration, 
our algorithm converges to some local optimal solution of V\. 

V. Simulation Results 

In this section, we illustrate the analytical contributions by simulation results for a symmetric 
hexagonal network topology. We apply the classic 19-cell-wrap-around structure to avoid edge 
effects and guarantee the consistent simulated performance for all cells; see Fig. 1. Each hexag¬ 
onal cell in the network has a radius of r = 500 meters, and is surrounded by 6 interfering 
cells in the first tier and 12 in the second tier. To achieve a symmetric pilot allocation in this 
hexagonal cellular network, the pilot reuse factor can be f3 e {1, 3,4, 7} as shown in Fig. 1. For 
each pilot reuse policy, the same subset of pilots are allocated to the cells with the same color, 
and pilots in each cell are allocated randomly to the users. 
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Fig. 1. The 19-cell-wrap-around hexagonal network topology for /3 = 1, /3 = 3, /? = 4 and /? = 7. 

The user locations are generated independently and uniformly at random in the in cells, but 
the distance between each user and its serving BS is at least 0.14r. For each user location 
z e M 2 , a classic pathloss model is considered, where the variance of the channel attenuation 
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is dj{ z) = || z _^ ) || K . The vector b ? e M 2 is the location of the BS in cell j, k is the pathloss 
exponent, and || • || denotes the Euclidean norm. C\ 7j) > 0 is independent shadow fading for some 
user location z with 10 log 10 (C( z) ) ~ A/^Cfcr 2 sf ). In the simulation, we assume k = 3.7, a 2 f = 5 
and the coherence block length S = lOOOjj 

A. Benefits of the proposed M-MMSE scheme 

In this subsection, we show the benefits of our M-MMSE scheme over the conventional 
alternatives. Statistical channel inversion power control is applied to both pilot and uplink payload 
data, i.e., pik = Tik = dl ' 7jlk) lfl7l . Thus during the uplink phase, the average effective channel 
gain between users and their serving BSs is constant: E{p ifc ||h Hfc || 2 } = E{r ifc ||h Hfc || 2 } = Mp. 
Then the average uplink SNR per antenna and user at its serving BS is p/cr 2 . This is a simple 
but effective policy to avoid near-far blockage and, to some extent, guarantee a uniform user 
performance in the uplink. For downlink payload data transmission, the transmit power g ik is 
selected according to Theorem [5] to achieve the same downlink SE at each user as in the uplink. 
In our simulation, p/o 2 is set to 0 dB to allow for decent channel estimation accuracy, and the 
time proportions for the uplink and downlink are set to (' ul = £ dl = 

To verify the accuracy of the large-scale approximations from Section [Till 10000 independent 
Monte-Carlo channel realizations are generated to numerically calculate the joint achievable 
SE in (1251) . The numerical results and their large-scale approximations from Theorem [3] and 
Theorem 0 are shown in Fig. [2] As seen from Fig. [2l the achievable sum SE per cell increases 
monotonically with [3 for the considered range of values. This is due to the following two 
properties. Firstly, a larger (3 results in a lower level of pilot contamination, contributes to 
a higher channel estimation accuracy, and thereby increases the achievable SE. Secondly, a 
larger (3 indicates more available estimated channel directions in the construction of the M- 
MMSE detector and precoder, thus a higher inter-cell interference suppression can be achieved. 
Moreover, Fig. [2] shows that the numerical results and the large-scale approximations match very 
well, even for small M and small K. 

To show explicitly the advantages of our M-MMSE scheme, simulation results for the matched 
filter (MF) from J8), the multi-cell ZF (M-ZF) scheme from lITTll . and the S-MMSE scheme 


3 This coherence block can, for example, have the dimensions of T c = 10 ms and W c = 100 kHz. 
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Fig. 2. Achievable sum SE as a function of the number 
of antennas M, for p € {1, 3, 4, 7} and K = 10. 



Fig. 3. Achievable sum SE of M-MMSE (squares), M-ZF 
(triangles), S-MMSE (diamonds) and M-MF (circles) with 
P = 1, K = 10 and K = 30. 



Fig. 4. Achievable sum SE of M-MMSE (squares), M-ZF 
(triangles), S-MMSE (diamonds) and M-MF (circles) with 
3 -- 4. K = 10 and K = 30. 


Fig. 5. Achievable sum SE of M-MMSE (squares), M-ZF 
(triangles), S-MMSE (diamonds) and M-MF (circles) with 
p = 7, K = 10 and K = 30. 


from (fl9l) are provided for comparison. The same downlink power acquisition from Theorem |5] 
and normalization from (l22l) are applied for all precoders. Notice that M — j3K > 0 is needed for 
the M-ZF scheme, thus the minimum value of M for the M-ZF is (3K +1. Simulation results are 
shown in Figs. [3] — 15] for /3 — 1, (3 — 4 and >3 = 7, respectively. The MF scheme always achieves 
the lowest performance since it does not suppress any interference. Compared to the S-MMSE, 
our proposed M-MMSE always achieves a higher sum SE, and the advantage becomes more 
significant as (3 and/or K increases. For 6 = 4 and M = 200, the SE of M-MMSE are 31% and 
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53% higher than those of S-MMSE for K = 10 and K = 30, respectively. For (3 = 7 , the gains 
increase to 42% and 82% for K = 10 and K = 30, respectively. The advantage of the M-MMSE 
over the M-ZF is only minor for small (3 and small K, but the gain becomes notable as (3 and 
K grow. Since the complexity of our M-MMSE scheme is the same as for the M-ZF, and the 
M-ZF can sometimes achieve very low SE for small M, in general our M-MMSE scheme is the 
better choice if high system SE is desirable. 

B. Effectiveness of the joint power control scheme 

In this subsection, the effectiveness of the power control scheme proposed in Section IV is 
testified. Since it has been shown in the previous subsection that the proposed M-MMSE scheme 
performs better than the conventional techniques, especially for large (3, we focus on the M- 
MMSE scheme in this subsection. Statistical channel inversion power control pi k = dl ^ lk ) ’ s still 
applied for pilots, while the uplink payload data power r ]k is optimized, p/a 2 is still set to 0 
dB and the maximal transmit power P ma x in P is selected to make the cell edge SNR (without 
shadowing) equal to -3 dB. Results for the equal power allocation (i.e., Tu, = Pmax ) is provided 
as a base line. We also apply Algorithm |Tj to the instantaneous SINR in (IT5l) for comparison. 
The following results are obtained for M = 100 and K = 10. After generating user locations 
and shadow fading, 9 users with the worst channel conditions in the whole network are dropped 
to provide 95% coverage. 

We first consider the performance metric of average user SE which is calculated as the network 
sum SE divided by the number of served users. The cumulative distribution functions (CDFs) 
over user locations are shown in Fig.|6]and Fig.|Tjfor (3 = 4 and (3 = 7 , respectively. As seen from 
the figures, the CDF curves with long-term power control based on Algorithm Q] coincide with 
those with short-term power control optimized for the instantaneous SINR at every coherence 
block, which validates our power control based on the large-scale SINR approximation. Since 
the approximation only depends on the long-term statistics, the optimization complexity can be 
spread over time. Furthermore, compared with the equal power allocation policy, the average 
user SEs can be significantly improved by our power control scheme. At the 50 percentile, 17% 
increase can be achieved by our scheme for both (3 = 4 and (3 = 7. 

We analyze how the per user SE at different parts of the cells is affected by our power control. 
Results are also provided for the power control proposed in fl34}, which tries to provide equal 
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Fig. 6. CDFs of average user SE with /3 = 4. 



Fig. 8. CDFs of per user SE with /? = 4. 



Fig. 7. CDFs of average user SE with /3 = 7. 



Fig. 9. CDFs of per user SE with j3 = 7. 


SE for users in the same cell so that, to some extent, intra-cell user fairness is guaranteed. 
CDFs of the per user SE are shown in Fig. [8| for (3 = 4 and in Fig. [9] for (3 = 7. Equal power 
allocation leads to the largest SE variations, while the power control from [f34l gives relatively 
small variations. Interestingly, the proposed power control from Algorithm [T| provides essentially 
the same SE for the weakest users, while pushing the SE of the majority of the users to higher 
values. Despite the larger SE variations, we conclude the proposed power control brings a better 
type of user fairness than the scheme from lf34l since the strong users get higher SEs without 
degrading for the weakest ones. 
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VI. Conclusions 


In this paper, a new state-of-the-art multi-cell MMSE scheme is proposed, which includes an 
uplink M-MMSE detector and a downlink M-MMSE precoder. Compared with the conventional 
single-cell MMSE scheme, that only makes use of the intra-cell channel directions, the novelty 
of our multi-cell MMSE scheme is that it utilizes all channel directions that can be estimated 
locally at each BS, so that both intra-cell and inter-cell interference can be actively suppressed. 
The proposed scheme brings very promising sum SE gains over the conventional single-cell 
MMSE and the multi-cell ZF from liTTTl . particularly for large (3 and K. Since imperfect es¬ 
timated CSI is accounted for in our scheme, the gains obtained by our scheme are likely to 
be achievable in practical systems. Furthermore, large-scale approximations of the uplink and 
downlink SINRs are derived for the proposed multi-cell MMSE scheme, and these are tight 
in the large-system limit. The approximations are easy to compute since they only depend on 
large-scale fading, power control and pilot allocation, and shown to be very accurate even for 
small system dimensions. Based on the SINR approximations, an uplink-downlink duality is 
established and a low complexity power control algorithm for sum SE maximization is proposed 
for the multi-cell MMSE scheme. The proposed power control brings a notable sum SE gain 
and also provides good user fairness compared to the equal power allocation policy. Since the 
SINR approximations depend only on long-term statistics, the complexity of the power control 
algorithm can be spread over a long time period. 


Appendix A 


Useful Lemmas 


Lemma 1 (Matrix inversion lemma (I), [13511 1: Let A e C MxM be a Hermitian invertible matrix. 
Then, for any vector x e C Mxl and any scalar r e C such that A + rxx /f is invertible, 



(39) 


Lemma 2 (Matrix inversion lemma (II), llT3i ): Let A e C MxM be a Hermitian invertible matrix. 
Then, for any vector x e C Mx 1 and any scalar r G C such that A + rxx /f is invertible, 



( 40 ) 
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Lemma 3 (Generalized rank-1 perturbation lemma, Lem. 14.3 in 0611 1: Let A e C MxM be 
deterministic with uniformly bounded spectral norm (with respect to M ) and B G C MxM be 
a random Hermitian matrix, with eigenvalues Af < ... < A® such that, with probability one, 
there exist e > 0 and M 0 such that Af > e for all M > M 0 . Then for any vector v e C Mx 1 , 

■y-tr (AB _1 ) —-p-tr ("a(B + vv 5 ) -1 ) —j- 0 (41) 

M K M V V ' / M-xx> 

where B 1 and (B + vv”) 1 exist with probability one. 


Lemma 4 (Lem. B.26 in 071 . Thm. 3.7 in P6l . Lem. 12 in P8l ): Let A e C MxM and x, 
y ~ CA'fO, ^Im)- Assume that A has uniformly bounded spectral norm (with respect to M) 
and that x, y and A are mutually independent. Then, for all p > 1, 

1) E{|x«Ax->(A)n = o(J F ; 

2) x^Ax - -jtr (A) 0 

M Af-> oo 

3) x^Ay 


M—too 


4 0 


4) E 


x iJ Ax)“ - (ifr (A))' 


M—> oo 


-> 0. 


Appendix B 
Proof of Theorem [3] 

Define X,- = (Hy^AyHy ■ + (cr 2 + ^-) Im) _ 1 , then the M-MMSE detector in |T8] is g j k = 
'Ejhjjk. We omit the superscript “M-MMSE” in the proof for brevity. In the following proof, we 
use x to denote the almost sure convergence such that a x b represents a — b — a,s —> 0. Define 

M—xx) 


-1 


1 ) H VJ ik - h V ji, hvjfe-i), h V y(i ifc +i),..., h VJB 

2) A jik diag (Aji, ...Aj(j ifc _i), Aj(j ifc _|_i), —, Aj_b), 

2) ^jjk = jj k + (c + </2j) I M 

4) Sj = MTjj and = AfSjjk, 

then we have the following lemma. 


Lemma 5 Let and h 

error, respectively, then 


jik denote the MMSE estimate of h jik as in ( 117)1) and its estimation 




Pjkdj (zj/j) 5jk 

1 + A ji jk 5jk 


a.s. 

M—>• oo 


>o, 



S,h 




0 

M—> oo 


(42) 

(43) 
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Proof: Let x = h I j I jk 'Sjhjj k , then 

X = jjk + 

(а) Pjkdj ( z j/c) ^VJij^jjk^VJijk _ JpPjkdj { z jk) ^Vyjijk^jjk^VJijk 

1 + ^jijk^VJijk^nk^-VJijk 1 + ^ji j kit* l vji. k 'Ej jk hv,ji jk 

( б ) IlPjktf (z ifc ) tr ^ jjPjktf (z jk ) tr ($ V ji ifc S i) 

(d) JJP.iP 1 ] ( Z jfc) tr (^V,ji jfc T j) (e) Pjkdj (Zjk) 5jk 

1 + ^yijk T^ftx ^ \i]k^jk 

where (a) follows from Lemma HI and the fact that h 1]k = y/pfkdj(z 3k )h V:3ijk and (6) follows 
from Lemma |4] 2). Notice that Lemma |4] 2) can be applied since 'E' jjk = + 

CT Im) - 1 has uniformly bounded spectral norm as M -)■ cxi, because p 3 scales as A' and 
A > 0 by assumption, thus jj > 0 for all M. (c) follows from Lemma [3 (d) follows from 
Theorem H] for D = <t Vji]k , T. ; = T, ( ,J' ! ) with R/, = A j h & V j b - In step (e), we use the 
notation 8 jk = -ptr^y^T,) and arrive at the expression in (42). 


Let y = hfj k T,jhji m , then 

y ~ i^jjk + ^jijk^VJijk^VJijk) ^jim 

(a) y/Pjkdj (z jk) hyji.^jjkiljlm _ y/Pjkdj (Zjk) Jj hy j i jk S jjj^jlm (ft) ^ ^ 

1 + ^jijk^Vjiji^jjk^VJijk I + 'VIT ^V../i , k ^/.■ ^ 1 V ,jij a- 

where steps (a) and b follow from Lemma Q] and Lemma [4] 3), respectively, which completes 
the proof. ■ 

We use this lemma in the following to determine the asymptotic behaviour of each term in 
the uplink SINR of (fl5T) . 


A. Signal power 

Since g^.h jjk = then according to Lemma 13 it is obvious that 

Pjkdj ( z jfc) djk a.s. „ 

1 + A jij k Ojk M^-OO 

By the continuous mapping theorem ll39ll . we further obtain 

2 _ / Pjkd) (zjfe) 8 jk \ 2 a . a . ; Q 
V 1 + ^ jijjjk / 




(46) 


( 47 ) 
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B. Channel uncertainty 
According to Lemma [5] 




H 


M —>-00 


A 0. 


(48) 


Thus by the dominated convergence theorem Il40ll and the continuous mapping theorem, we have 


E i T jk gpMjjk 


l (i) 


M—>• oo 


-> 0. 


(49) 


C. Interference power 

Since gf k = Y,jhjj k , the interference power from user m in cell l is E{ h | { |gj / / : li J 7 m | 2 |li [ y)} = 
E{|hE fc S J h J ; m | 2 |h( J )}. The computation depends on which pilots that are used. 

1) km — ijk = *o-' In this case, user k in cell j use the same pilot sequence as user m in cell 
j, and there will be coherence pilot contaminated interference. Since 


i /- j ( _ /Plm dj (z Im) £ 

D-jlm ~ \/Plm a j [plm ) n Vji 0 — \j --- 7 n jjk, 

Pjk Clj [Zjk) 


(50) 


we have 




Plm dj { z lm) CH U I (2) 1 ( \J( \ \/PjkPlm^jk 

Pjk dj (Z jk ) + hjjtEjhflm ~ dj (Z jk ) dj (z lm ) { + ’ ( 51 ) 

where in step (a) the first term remains and the second term vanishes according to Lemma [5] 
Indicated by the dominated convergence theorem and the continuous mapping theorem, we have 


E 




2 L 1 j2 ( W2/ \ PjkPlmSjk 

h(j) > - dj (z jk )dj (zim) 


(l + ^jijk^jk) 


M—> oo 


A 0. 


(52) 


2) km 7 ^ ijk-' In this case, two users have different pilots, such that 

2 




( a ) ,2/ \ M 2 ^V,jijk ^jjk^-jlm^jlm^jjk^VJijk 

= p jk dj (Z jk ) 


(b) 


p jk d 2 j (z jk ) 


I 4" 


M 2 tr ( 


(l + A ji jk Sjk) 


— Pjkdj ( Zj k ) 


= p jk d] (z jk ) 


(l + A ji jk Sjk) 

^-jlm^jjk^V.j i.j k ^jjk hj l m 

(l + A jijjjk) 


(53) 
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where step (a) follows from Lemma [Hand the definition of £(- fe . Step ( b ) follows Lemma [4] 2), 
Lemma [3] and Theorem [H It remains to obtain a deterministic equivalent of the numerator in 

j,jk,lm = jj k — ^jilrn^V ,jii n 


Define Y , ,/,/,„ = — A,,,. li v li(f) 1 , then according to Lemma[2]we have 


Yjjk Y 


j,jk,lm 


v \ ii y 

1 


Plugging (1541) into the numerator of (1531) . we obtain 


(54) 


^jlm^jjk^VJijk^jjkhjlm ~ ^ l jlm^j,jk,lm^V,ji jk ^ j,jk,lm h 


H 


*-jlm 


-2 Re 


v 

(intf. 1) 


\ uH V ^ V Vi Vi# V Vi 


1 + ^jkrn ^V, ^j,jk,lmhv,j 


V 


+|A. 




(intf. 2) 




3Hr\ 


1 + A,j, hv 


jkm n V,jii m ^jJkJmMVjiir, 


(55) 


v 

(intf. 3) 


Deterministic equivalent of (intf. 1): Define Y!-- klm = MYjjk,im, then following similar 
procedures as before, it is straightforward to show that 

dj (zir 


^jlm^j,jk,lm^V,jijk ^ j,jk,lm\^jlm 


M 2 

dj (z im ) 

M 2 


tr £ 


y 


tr ( 


dj ( Zlm) ( 

M 2 


tr(T jfc ), (56) 


<r +V>j 


M 


, D — I M , 0 — 4> 


vjq-fc 


and 


where T ;fc = T( fc (a) is given by Theorem [2] for a = 

R& = A jb&vjb- 

Deterministic equivalent of (intf. 2): Instead of tackling the expression in (intf. 2) directly, 
we derive the deterministic equivalents of its numerator and denominator, respectively. Plugging 
h jim = h jim + hj/m and hji m = s /p^dj{ z im )h V ji Ira into the numerator, we have that 

40 \fPlmdj (z/ r 


^jlm^jjk,lm*&V,jijk^-‘j,jk,lm3 l V,jii rl 


M 2 

W yfPlmdj (Z lm ) 

M 2 

\X Plmdj (z, m ) 


tr 

tr 


'djlmk 
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(57) 
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by defining 'd' ]lmk = j^ti(^ V j ilri T'- k ) e R. Step (a) follows from Lemma 0] 2) and Lemma [3] 
Step (6) follows from Theorem [2] Similarly, we have 

y/Plmdj (z i r 


:h 




M 

yjPlmdj (z ir 


-tr $ 




C.H y y^H 


tr | jj yjPlmdj (z; m ) 1? jlrrn (58) 

: djim, (59) 


M 

where fijim is defined as 'dji rn = 3ytr(<E> V jj im T ; -) e R and T ? is given in Lemma \5\ Based 
on (1571) - (l59l) . the equivalents of the denominator and numerator are given as 1 + A ji lm dji m 
and j-jd'jimk'djimPimdj (z /m ) X , Jllm , respectively. Therefore, according to the continuous mapping 
theorem, we have 

djlmk'djlm Plmdj (z i m ) Xji 


(intf. 2) 


'JHm a.s. 


■> 0. 


(60) 


1 + film'd jlm. M M^o o 

Deterministic equivalent of (intf. 3): Based on the techniques used to characterize (intf. 1) 
and (intf. 2), it is straightforward to show that 


Plugging ([56]), 




(intf 3) I djim | dj lmk pi m dj (z/ m ) a s ^ ^ 

(! + X jkmdjlm ) 2 M 

and ([6Tb into (1531) . we have that 


(61) 


Pjkdj (Z j k ) ( dj (Z i m ) 


(1 + A M 

pjkdfj (z jk ) dj (z i 

m ) Pjlmk 


Plmdj (z im) \ji lm dj lmk djl r 


2 + A jii m djlm 

(1 + A ji lrn djl m ) 


(l + djk) M 

where pji mk = ^tr(T'- fc ) - Pimdj(zi m )X jilrn d' jlmk dj lrn is defined. Consequently, 


(62) 


we 


have by the dominated convergence theorem that 


E 




h (j) > ~ 


Pji-dj (z j k ) dj (z I m) f^jlmk a.s. 


(1 + X j i jk S jk )M 


-> 0. 


(63) 
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D. Noise power 


The noise term in (IT31) is scaled by 11 g 7 /, 11 2 for which we have that 


Sjk\\ Pjkdj { z jk ) 


(a) 


M ' 2 ,jijk '^jjk^jjkd l Vjijk 
1 + 


( 6 ) 


p jk d 2 j (z jk ) 


tr ( $ 


vjijk L jk) (c) p jk d 2 (z jk ) d jk 


(1 + *hM 2 M* (1 + A jiik 5 jk ) 2 M' 


(64) 


where step (a) follows from Lemma Q] and step (6) follows from Lemma 0] 2), Lemma [3] and 
Theorem [2j T" jk = T ” k (a) is given by Theorem |2] for a = <L pp-, © = I m, D = ®v,ji jk , 
and R fe = A j b <f> V j b . In step (c), d"- k = jjtr(^ V j ijk T" jk ) is defined. Then by the dominated 
convergence theorem, we have 


E 




Pjkdj (gjfc) p k a , s , t 
(1 + A jiJjtfM 


(65) 


Finally, by the continuous mapping theorem, we have 

ul_ Tj k p jk dj (z j k ) 6j k _ a s ^ ^ 

3k E T lm p lm dj (z lm ) + £ T lm dj(Z lm ) PP + M^oo 

which completes the proof. 


( 66 ) 


Appendix C 
Proof of Theorem [4] 

Except for the channel variance varjhj^w j k } = E{ lij( A .w ; /, — E{h^ fc w jfc }| 2 }, large-scale 
approximations of the signal power and the interference in (l24l) can be calculated by follow¬ 
ing similar procedures as in Appendix [BJ Thus, only the variance of the effective channel is 
considered here and we show that it goes to zeros as M —> oo. 

Define c = hf jk Sjh jjk , c = ¥.{h? jk E j h jjk }, and b = hf jk Z j h jjk , then 

var { h f jk w jk } = ^-E {h} {| hf jk g jk - E { h f jk g jk } | 2 } 

= — E {|c - c + b\ 2 } = — E{(c - c) (c + c)} + — E {|6| 2 } , (67) 

r fj k 'Yj k 'Yj k 

where the last step is due to the fact that h ]jk is independent of h jjk and that E { h) = 0. 
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From step (a) of Eqn. (1441) . we have 

Vjkdj i z jk) ji "Sjjkhyji jk pjkdj (zjfc) ^ Y]jjkhv,ji jk Pjkdj i^jk) A a 

c =- ^^7 -- —7 -<- ^ -= 0. (68) 

1 + ^jijk^VJijk i l i k 

Therefore, c < 9 and same bound also holds for c. Thus we have 

o Q 1 

var {hjjfcWjfc} < —E{|c-c|}H-E{|6| 2 }. (69) 

7 jk Ijk 

It is shown by Lemma \5\ that c — ( . Zjfc )S f —0. Since c and c are bounded, this implies by 
the dominated convergence theorem that E{|c — c|} —» 0 as M —> oo. 

Furthermore, we have that 

— T 2 ® — T^tr (^jjkCjjk), (70) 

t j r j 

where step (a) holds because X,- 7 2-1 M (where A ■< B means that B — A is positive semi- 
definite). Since ip 2 scales as K 2 or equivalently as M 2 , and tv(&jj k Cjjh) scales as M, we have 
that E{|6| 2 } —> 0 as M —> oo. Consequently, 

var {h^w, fc } > 0. (71) 


Appendix D 
Proof of Theorem [5] 

To prove Theorem 0 we first rewrite the large-scale approximation of SINR fjjl in a more 
tractable way by only having one index instead of two. Define a diagonal matrix D e C LKxLK 

as 

Pjk d 2 (z jk ) 6 2 k 


and a matrix F e £LKxlk 


D 


k+(j-l)K,k+(j-l)K 


Fi 


k+(j-l)K,m+(l-l)K 


= 


if 

V jk 

o, 

S %Pl m m ) 

«£ ’ 

dj fJ'jlmk 

M'd"-, ’ 


if (/, m) = (j, k ), 

if iim = ^ (j,k), 

if ilm 7^ fki 


(72) 


(73) 


respectively, where [-]jj represents the element in the zth row and the jth column of the 
corresponding matrix. 
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Furthermore, define the vectors r = [r n , ..., tlk] T £ M LA xl and the scalar / = k+(j — l)K G 
{1, LK }, then the uplink SINR approximation in (1351) can be expressed as 

nDu 


Vfk 


(Ft), + 


(74) 


M 


where (•)/ denotes the /th element of the corresponding vector. Furthermore, define the diagonal 
matrix \F ul = diag{r)“j,..., f]' kK \ G C LKxLK and the vector 1 = [1,..., 1] T G ~R LKxl , then the 


above r]' d k can be written in matrix form as 


Dr = \F ul Fr + — ^ ul l. 

M 


(75) 


Using the same notations from (172b and (1731) . and defining the vector g = [p n ,..., Qlk] t € 
i LKxl , the downlink SINR approximation in (l36l) can also be expressed in matrix form as 


Dg = V dl F T g + —'F ai l, 

M 


a 


dli 


(76) 


where v F' n = diag {Vni ■ ■ ■ ,Vu<} ff LKyLh i s a diagonal matrix. By setting the downlink 
SINRs equal to the uplink SINRs as V F " 1 = v F' n = \F, then the uplink and downlink powers 
must satisfy 


2 

T= (D — ^F) -1 fi/1, 

M ' 


(77) 

(78) 


if (D — fi'F) is invertible. From the above two equations, it is straightforward to show that 
1 T g = T l 1, which means that the total power is the same. Moreover, after selecting the uplink 
power r according to some performance metric, we can compute the matrices D, V F and F 
and these will make (D — fi'F) invertible. We can then obtain the downlink power g according 
to (1781) . In this way, the same sum SE as in the uplink is achieved in the downlink. 
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